Benefiting from Disorder: Source Coding for Unordered Data
نویسندگان
چکیده
The order of letters is not always relevant in a communication task. This paper discusses the implications of order irrelevance on source coding, presenting results in several major branches of source coding theory: lossless coding, universal lossless coding, rate-distortion, high-rate quantization, and universal lossy coding. The main conclusions demonstrate that there is a significant rate savings when order is irrelevant. In particular, lossless coding of n letters from a finite alphabet requires Θ(logn) bits and universal lossless coding requires n + o(n) bits for many countable alphabet sources. However, there are no universal schemes that can drive a strong redundancy measure to zero. Results for lossy coding include distribution-free expressions for the rate savings from order irrelevance in various highrate quantization schemes. Rate-distortion bounds are given, and it is shown that the analogue of the Shannon lower bound is loose at all finite rates.
منابع مشابه
بررسی میزان صحت کدگذاری در بیمارستانهای آموزشی دانشگاه علوم پزشکی و خدمات بهداشتی درمانی شیراز
The research was intended to determine the rate of coding accuracy in the training hospitals of Shiraz University of Medical Sciences and Health Treatment Services in 1995 (1374), and it was performed through a descriptive-analytic method. In the research, 400 medical records were selected based on stratified sampling method from among records of the patients having been discharged from hospita...
متن کاملImage Classification via Sparse Representation and Subspace Alignment
Image representation is a crucial problem in image processing where there exist many low-level representations of image, i.e., SIFT, HOG and so on. But there is a missing link across low-level and high-level semantic representations. In fact, traditional machine learning approaches, e.g., non-negative matrix factorization, sparse representation and principle component analysis are employed to d...
متن کاملNetwork Coding for Energy-Efficient Distributed Storage System in Wireless Sensor Networks
A network coding-based scheme is proposed to improve the energy efficiency of distributed storage systems in WSNs (wireless sensor networks), which mainly focuses on two problems: firstly, consideration is given to effective distributed storage technology in WSNs; secondly, we address how to repair the data in failed storage nodes with less resource. For the first problem, we propose a method t...
متن کاملCoding over Sets for DNA Storage
In this paper we study error-correcting codes for the storage of data in synthetic DNA. We investigate a storage model where a data set is represented by an unordered set of M sequences, each of length L. Errors within that model are losses of whole sequences and point errors inside the sequences, such as insertions, deletions and substitutions. We propose code constructions which can correct e...
متن کاملRecognizing Stalactites and Stalagmites of ELT Vodcasting in Teacher Education Based on Activity Theory and Visual Thinking Strategies
Teachers need to know new applications for developing online materials and then they should become aware of different ways to present them to online audience. This study tries to help ELT teachers to overcome challenges and difficulties of vodcasting by recognizing their social and biological motives (Activity Theory), and using Visual Thinking Strategies (VTS). It can alter teachers’ negative ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/0708.2310 شماره
صفحات -
تاریخ انتشار 2007